SLIM: a sliding linear model for estimating the proportion of true null hypotheses in datasets with dependence structures

نویسندگان

  • Hong-Qiang Wang
  • Lindsey K. Tuominen
  • Chung-Jui Tsai
چکیده

MOTIVATION The pre-estimate of the proportion of null hypotheses (π(0)) plays a critical role in controlling false discovery rate (FDR) in multiple hypothesis testing. However, hidden complex dependence structures of many genomics datasets distort the distribution of p-values, rendering existing π(0) estimators less effective. RESULTS From the basic non-linear model of the q-value method, we developed a simple linear algorithm to probe local dependence blocks. We uncovered a non-static relationship between tests' p-values and their corresponding q-values that is influenced by data structure and π(0). Using an optimization framework, these findings were exploited to devise a Sliding Linear Model (SLIM) to more reliably estimate π(0) under dependence. When tested on a number of simulation datasets with varying data dependence structures and on microarray data, SLIM was found to be robust in estimating π(0) against dependence. The accuracy of its π(0) estimation suggests that SLIM can be used as a stand-alone tool for prediction of significant tests. AVAILABILITY The R code of the proposed method is available at http://aspendb.uga.edu/downloads for academic use.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sensorless Indirect Field Oriented Control of Single-sided Linear Induction Motor With a Novel Sliding Mode MRAS Speed Estimator

This paper proposes a new sliding mode control (SMC)  based model reference adaptive system (MRAS) for sensorless indirect field oriented control (IFOC) of a single-sided linear induction motor (SLIM). The operation of MRAS speed estimators dramatically depends on adaptation mechanism. Fixed-gain PI controller is conventionally used for this purpose which may fail to estimate the speed correctl...

متن کامل

Estimating the proportion of true null hypotheses, with application to DNA microarray data

We consider the problem of estimating the proportion of true null hypotheses, π0, in a multiple-hypothesis set-up. The tests are based on observed p-values. We first review published estimators based on the estimator that was suggested by Schweder and Spjøtvoll. Then we derive new estimators based on nonparametric maximum likelihood estimation of thep-value density, restricting to decreasing an...

متن کامل

Estimating the Proportion of True Null Hypotheses under Dependence

Multiple testing procedures, such as the False Discovery Rate control, often rely on estimating the proportion of true null hypotheses. This proportion is directly related to the minimum of the density of the p-value distribution. We propose a new estimator for the minimum of a density that is based on constrained multinomial likelihood functions. The proposed method involves partitioning the s...

متن کامل

Investigation of SLIM Dynamic Models Based on Vector Control for Railway Applications

Although, Single-Sided Linear Induction Motor (SLIM) utilization has increased in railway applications due to their numerous advantages in comparison to Rotational Induction Motors (RIM), there are some sophistication in their mathematical models and electrical drive. This paper focuses on the problems of SLIM modeling, with assuming end-effect on the basis of Field Oriented Control (FOC) as a ...

متن کامل

Estimating the Proportion of Nonzero Normal Means under Certain Strong Covariance Dependence by

The proportion of certain type of hypotheses is a key component of adaptive false discovery procedures in multiple testing. To date, a good estimator of the proportion of false null hypotheses under dependence is lacking. For multiple testing normal means, we develop a (uniformly) consistent estimator of the proportion of nonzero normal means when the dependent test statistics follow a joint no...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 27 2  شماره 

صفحات  -

تاریخ انتشار 2011